Statistical methods for spoken dialogue management
نویسنده
چکیده
Statistical methods for spoken dialogue management Blaise Thomson Spoken dialogue systems provide a mechanism for interacting with computers that is both natural and effective for human use. This thesis describes a practical framework for building these systems based on the Partially Observable Markov Decision Process (POMDP). The underlying belief state is represented by a dynamic Bayesian Network and the policy is parameterized using a set of action-dependent basis functions. Tractable real-time Bayesian belief updating is made possible using a novel form of Loopy Belief Propagation with various other optimisations and policy optimisation is performed using an episodic Natural Actor Critic algorithm. Parameters for the system’s user model may be learned from an unannotated corpus of dialogues using the Expectation Propagation algorithm with tied Dirichlet distributed parameters. Details of these algorithms are provided along with evaluations of their accuracy and efficiency. The proposed POMDP-based architecture was tested using both simulations and a user trial. Both indicated that the incorporation of Bayesian belief updating significantly increases robustness to noise compared to traditional dialogue state estimation approaches. Furthermore, policy learning worked effectively and the learned policy outperformed all others on simulations. The learned policy was also competitive in user trials, although its optimality was less conclusive. Parameter learning was tested by evaluating the resulting user models’ rescoring ability and by simulating dialogue performance. A range of metrics were shown to improve when using Expectation Propagation to learn a set of parameters and then re-score semantic hypotheses, as shown on both a simulated and a human-generated corpus. Success rates of a system where the parameters were trained from a corpus of dialogues were also shown to outperform the use of handcrafted parameters. Overall, the proposed framework was shown to be a feasible and effective approach to building real-world POMDP-based dialogue systems.
منابع مشابه
On-Line Learning of a Persian Spoken Dialogue System Using Real Training Data
The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...
متن کاملOn-Line Learning of a Persian Spoken Dialogue System Using Real Training Data
The first spoken dialogue system developed for the Persian language is introduced. This is a ticket reservation system with Persian ASR and NLU modules. The focus of the paper is on learning the dialogue management module. In this work, real on-line training data are used during the learning process. For on-line learning, the effect of the variations of discount factor (g) on the learning speed...
متن کاملStatistical User Simulation for Spoken Dialogue Systems: What for, Which Data, Which Future?
There has been a lot of interest for user simulation in the field of spoken dialogue systems during the last decades. User simulation was first proposed to assess the performance of SDS before a public release. Since the late 90’s, user simulation is also used for dialogue management optimisation. In this position paper, we focus on statistical methods for user simulation, their main advantages...
متن کاملBayesian update of dialogue state: A POMDP framework for spoken dialogue systems
This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on the partially observable Markov decision process (POMDP), which provides a well-founded, statistical model of spoken dialogue management. However, exact belief state updates in a POMDP model are computationally intrac...
متن کاملReward Shaping for Statistical Optimisation of Dialogue Management
This paper investigates the impact of reward shaping on a reinforcement learning-based spoken dialogue system’s learning. A diffuse reward function gives a reward after each transition between two dialogue states. A sparse function only gives a reward at the end of the dialogue. Reward shaping consists of learning a diffuse function without modifying the optimal policy compared to a sparse one....
متن کاملSpeech understanding, dialogue management and response generation in corpus-based spoken dialogue system
This paper presents construction of a spoken dialogue system using a large-scale spoken dialogue corpus with intention tags. In this system, all of main components, such as speech understanding, dialogue management, and response generation, are constructed with corpus-based methods. An evaluation experiment using a test set has shown that the performance of the corpus-based dialogue system is i...
متن کامل